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Abstract 

A geometric inequality among three triangles, originating in circle packing prob- 
lems, is introduced. In order to prove it, we reduce the original formulation to the 
nonnegativity of a particular polynomial in four real indeterminates. Techniques based 
on sum of squares decompositions, semidefinite programming, and symmetry reduction 
are then applied to provide an easily verifiable nonnegativity certificate. 



1 Introduction 

In this paper we prove the following geometric inequality: suppose that we have three 
triangles. One with sides of lengths X, Y and Z, a second with sides of lengths U, V 
and W and a third triangle with sides of lengths {X + U), (Y + V) and (Z + W). Let 
us denote by a the angle in the first triangle between the sides of lengths X and Y. 
Let (5 be the corresponding angle in the second triangle between U and V and let 7 be 
the corresponding angle between (X + U) and (Y + V) in the third triangle. Then 

a ■ (X + Y - Z) + ■ (U + V - W) < 7 • {(X + U) + (Y + V) - {Z + W)). 

It turns out that proving this inequality is not at all simple. The need for this inequality 
originates in [HJ] . This last paper describes a new approach to circle packings |^, |llj . 
The main features of this approach are the theory of Perron- Frobenius for non-negative 
matrices, ||, 13] and fixed-point theory, ||, 0|. In particular our approach uses 
a converse of the contraction principle as appears in [2|. A central object that is 



, of a graph embedding. 



introduced in |]To| is the a-mapping, /„ : IR + '^' — > IR + ^' 
This is a variant of Thurston's relaxation mapping. A key property of the a-mapping, 
fa, is its super-additivity, 

Vr,s G Mr) + fc(s) < fa(r + s). 

It turns out that this property is implied by the above geometric inequality. A very 
interesting feature of our proof of this inequality is the use of semidefinite programming 
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based algorithms for producing representations of non-negative polynomials as sums 
of squares |4], ||. 

We make a reduction of the original inequality to the non-negativity of a certain 
real polynomial of degree 20 in 4 real indeterminates. Although instances of this size 
are near the limits of what can be achieved using generic methods, our particular poly- 
nomial enjoys certain convenient sparsity and symmetry properties. These allow the 
use of new algorithms, introduced in ||, customized for polynomials with an invariant 
structure. Even though the computational procedures in its current form use floating 
point arithmetic to arrive at the result, the final solution can be easily verified in a 
completely independent fashion. The methods work quite nicely in the problem at 
hand, producing a concise representation of the polynomial as a sum of five squares, 
thereby concluding the proof. 



The theorem has two simple geometric interpretations: 

(1) Three circles of radii R, a and b that are mutually tangent to one another from 
the outside form an Euclidean triangle. The vertices of the triangle are the centers 
of the circles. The sides of the triangle have the following lengths: R + a, a + b and 
R + b. Similarly three circles of radii S, c and d form a triangle of sides S + c, c + d and 
S + d. Finally, a third such triangle is formed by three circles of radii R + S + a + c, 
a + c + b + d and R + S + b + d. We note that the sides of the third triangle have 
lengths which are the sums of the corresponding sides of the first two triangles. On 
the other hand the three sets of triples of circles form also three circular triangles. The 
vertices of these triangles are the tangency points of pairs of circles in each triple. The 
lemma implies that the circular sides of the third (largest) triangle are greater than 
or equal to the sums of the corresponding circular sides of the first two circular triangles. 

(2) Let us consider three Euclidean triangles. One with sides of lengths X, Y and 
Z, and an angle a between X and Y. A second triangle with sides of lengths U, V 
and W, and an angle (5 between U and V. A third triangle with sides of lengths 



(X + U),(Y + V) and (Z + W), and an angle 7 between (X + U) and (Y + V). Then 
a ■ (X + Y - Z) + ■ (U + V - W) < 7 • ({X + U) + (Y + V) - [Z + W)). 



2 The super-additivity of fa 

As proved in jjOJ], the super-additivity of fa follows from 



Theorem 1 If a,b,c,d,R,S > 0, then 
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3 Reducing to algebraic inequalities 



We now make a reduction of the inequality of Theorem 1. There are two ideas involved 
in it. The first idea is summarized in the following, 

Lemma 1 Suppose that there exists a twice differentiable, surjective and strictly in- 
creasing function f : I — > [0, 1] which satisfies the following two conditions: 



(1) 

(2) 



f (l"/ 2 )+/-(/) 2 <0 onl. 



R + S) J \\ {R + a)(R + b) J \R + SJ J \\ {S + c){S + d) 



(a + c)(b + d) 



{R + S + a + c)(R + S + b + d)J ' 
for all a, b,c,d, R, S > 0. Then, the inequality of Theorem 1 holds true 

Proof. 

Consider the function y = sin -1 f(x) for x G I. Then, 

dy f 



dx yi^p 

d 2 y _ f"(l-f 2 ) + f-(f') 2 

dx* (1 _ /2)3/2 

By condition (1) we get d 2 y/dx 2 < on I and hence y is concave in I. So for any 
x,z e I and for any < t < 1, we have 

isin _1 f(x) + (1 - t) sin.- 1 f(z) < sin' 1 f(tx+ (1 - t)z). (1) 

We make the following choice, 



(R + a)(i? + b) J ' ' \J (5 + c)(S + d) / ' \R + S 
Then, by inequality (1) we get, 



+ IV (i? + a)(i? + 6) 



+ Uts) sm " (\/ (5 + cK5 + (i ) J Ssm " / < te + (1 -»'- 

By condition (2) we have, 

< f- 1 



(a + c)(6 + d) 



(i? + S + a + c)(i? + S + b + d) 
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and since / is increasing and also sin 1 is increasing, we get, 



Theorem 1 follows by inequalities (2) and (3). O 
special cases. 

(I) f(x) = sinx, I = [0, vr/2]. Then in this case we have, 

/ (1 — f 2 ) + / • (/ ) 2 = — sinx cos 2 x + sinx cos 2 x = 

and condition (1) of the theorem is satisfied. Condition (2) is the inequality of Theorem 
1 and so the theorem is correct trivially in this case. 



(II) f(x) = 1 — 1/x, I = [1, oo]. Then in this case we have, 

2 

X J I \ X I X 



1 3 
= — - — < 0, 

x° x 4 

for x > 1. So condition (1) is satisfied. Condition (2) and the conclusion of the theorem 
prove the following, 

Lemma 2 If for every a, b, c,d,R,S>0 the following inequality is true, 
R 



R + SJ \l-^/(ab)/[{R + a){R + b)\ 



+ 



+ {r + s) (l-V(cd)/[(5 + c)(S + d)]J " 

1 

< 



1 - y/[(a + c)(6 + 3J17P + 5 + a + c)(i? + 5 + b + d)\ ' 
i/ien, i/ie inequality of Theorem 1 is true. 

The second idea in this approach (after that of Lemma 1) is an elementary trick to get 
rid of the square root functions in Lemma 2. Let us denote, 



R + a 1 V R + h ' VS + c yS + d' 

Then < a, /3, 7, <5 < 1. Also a, /3 are independent except for a = 1 iff /3 = 1. That 
happens only if R = 0. 7, 5 are independent except for 7 = 1 iff (5 = 1. That happens 
only if S = 0. For the inverse transformations we have, 
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With these notations, the left hand side of the inequality in Lemma 2 is, 

( R \ ( 1 \ R(l-7S) + S(l-a(3) 

\R + SJ \l-ap) \R + SJ \l-jSJ (R + S)(l-a(3){l-j5)' 

As for the right hand side, we have, 



a + c 



R+ S + a+ c 



b + d 



a 2 {l 


-7 2 )i? + 7 2 (l 


-a 2 )S 


(1 


-7 2 )«+(l- 


a 2 )S 


'/3 2 (1 


-S 2 )R + S 2 (1 


-P 2 )S 




-5 2 )R+(1- 


p 2 )s ■ 



_ R+S+b+d 
Plugging these into the inequality of Lemma 2 we get, 

#(1 - 7<5) + S(l - a/3) 1 



(R + S)(1- a/3)(l - 7<5) " 1 - hh ' 
Hence 

ifa/3(l - 7(5) + SjSjl - a/3) 
1 2 " - 7 <5) + 5(1 - a/3) ' 

On squaring both sides we conclude that in order to prove Theorem 1, it suffices to 
prove the following, 

Lemma 3 If R, S > and < a, (3, 7, <5 < 1, then, 

/ V(l- 7 2 )^ + 7 2 (l-Q 2 )<g \ / /? 2 (l-(5 2 )fl + (5 2 (l-/3 2 )ff \ 
^ (l- 7 2 )i? + (l-a 2 )S ) { (1 - <5 2 )i? + (1 - (3 2 )S )~ 

/Raf3(l - 7<5) + 57<5(1 - a/3) \ 2 
- I S(l - 7*) + S(l - af3) ) ' 

Proof: Let us define, 

'a 2 (l - 7 2 )i? + 7 2 (1 - a 2 )S\ ( (3 2 (1 - 5 2 )R + 7 2 (1 - a 2 )S\ 



E 



(1 - 7 2 )i? + (1 - a 2 )S ) \ (1 - 5 2 )R + (1 - a 2 )S 

^ / Rapjl - 7 (5) + ^7(5(1 - a/3) \ 2 
~ V R(l - jS) + S{1 - af3) ) " 



Then, 

RS(R + S)[(l - 7<5)L -R+(l- af3)M ■ S] 



E 



[(1 - 7 2 )i? + (1 - a 2 )S] [(1 - 5 2 )R + (1 - a 2 )S] [(1 - 7 <5)i? + (1 - ap)S] 2 ' 
where we have, 

L = a 2 f3 2 {a - (if + (a - P) 2 j 3 5 3 + 
+{(a(5) 2 (l - a/3)(l + ap- 2/3 2 ) - (aS)(pj)(2 - 4a/3 + pa 3 + ap 3 )+ 
+ (/? 7 ) 2 (l - a/3)(l + aP- 2a 2 )}+ 
+{ 7 2 /?(l - a/?)(2a - /3 - a/3 2 ) - 7< 5(a 2 + /3 2 + 2a 3 /3 3 - 4a 2 /? 2 ) + 
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+5 2 a(l - a(3)(2[3 -a- a 2 /?)} 7 (5, 
or as a polynomial in 7 and 5, 

L = a 2 f3 2 {a - (3) 2 + (3 2 (1 - a(3)(l + a(3 - 2a 2 ) 7 2 + 

+a 2 (l - a(3)(l + a(3- 2(3 2 )5 2 - a/3 (2 + a(3 3 - 4a(3 + (3a 3 )^5+ 

+(3(1 - a/3) (2a -(3- a/3 2 ) 7 3 <5 - (a 2 + (3 2 + 2a 3 /3 3 - 4a 2 /? 2 ) 7 2 <5 2 + 

+a(l - a/3) (2/9 - a - a 2 /3) 7 5 3 + (a - /3)V<5 3 , 

and where M = M(a, /3, 7 , 5) = L( 7 , 5, a, /?). Thus it suffices to prove that for any < 
a, (3,j,6 < 1 we have L(a,{3, 7,6) > 0. For this will also imply that M(a,(3, 7, <5) > 
for any such a choice. This, in turn, will show that £7 > for every choice of R, S > 
and < 0,13,^,5 < 1 and hence will prove Lemma 3. To check the non-negativity of 
L(a, (3,7,6) we make the substitutions 

x 2 y 2 z 2 w 2 

(a, (3,7,6) = (— — 2>T— — 2'7~^ — 2'7~^ — 2) ( 4 ) 
1 + x A 1 + 2/ 1 + z 1 + ur 

and clear the denominators. This will give us a polynomial in ]R[x, y, z, w\. In fact, 
this polynomial is 

P(x,y,z,w) = 

= L (-^- 2 , -^- 2 , -^- 2 , -^V) (1 + x 2 )\l + y 2 )\l + z 2 ) 3 (l + „ 2 ) 3 . (5) 
\l + x z 1 + y 1 + 2 I + w y 

As a consequence, it suffices to check that y, z, w) is non-negative for all real values 
of its indeterminates. We conclude the proof of Lemma 3 in the following section, after 
a brief detour explaining the sum of squares based methods we have used. 



4 Sums of squares 

An obvious sufficient condition for non-negativity is to represent P(x,y,z,w) as a 
sum of squares of real polynomials. The connections between sums of squares and 
non-negativity have been extensively studied since the end of the 19th century, when 
Hilbert showed that in the general case the two conditions are not equivalent. We 
refer the reader to the wonderful survey Jl^] by Reznick on the available results and 
history of Hilbert's 17th problem. In the work of Choi, Lam, and Reznick [|J] the alge- 
braic structure of sums of squares decompositions is fully analyzed, and the important 
"Gram matrix" method is introduced. On the computational side, convex optimiza- 
tion approaches to this problem originate in the early work of Shor fll4|| . Recently, 
efficient techniques using semidefinite programming and exploiting problem structure 
have been developed in [^, A brief description of the methods follows, referring the 
reader to the cited works, and the references therein, for the full algorithmic details. 

We explain next the general idea of the Gram matrix method. Given a multivariate 
polynomial F(x) for which we want to decide whether a sum of squares decomposition 
exists, we attempt to express it as a quadratic form in a new set of variables u. A 
judicious choice of these new variables will depend on both the sparsity structure and 
symmetry properties of F ||. For instance, for the simplest case of a generic dense 
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polynomial of total degree 2d, the variables u will be all the monomials (in the variables 
x) of degree less than or equal to d. Consequently, we try to represent F(x) as: 



F(x) = u T Qu 



(6) 



where Q is a constant matrix. Since in general the variables u will not be algebraically 
independent, the matrix Q in the representation @ is not unique. In fact, there is 
an afSne subspace of matrices Q that satisfy the equality, as can be easily seen by 
expanding the right-hand side and equating term by term. If in the representation 
above the matrix Q can be chosen to be positive semidefinite, then a factorization of 
the matrix Q directly provides a sum of squares decomposition of F(x). Conversely, if 
F is a sum of squares, then such a Q can always be constructed by expanding the terms 
in monomials. Therefore, the problem of checking if a polynomial can be decomposed 
as a sum of squares is equivalent to verifying whether a certain affine matrix subspace 
intersects the cone of positive definite matrices. This latter class of convex optimization 
problems is known as semidefinite programs (SDP) fl6| , and can be efficiently solved 
using a variety of numerical algorithms, mainly based on interior point methods. 

Example 1 Consider the quartic form in two variables described below, and define 
u = [x 2 ,y 2 ,xy] T . 



F(x,y) 



2x 4 + 2x 3 y - x 2 y 2 + 5y 4 





T - 


X 2 








y 2 




. xy _ 





<?13 




X 2 


<?23 




y 2 


<733 _ 




. xy . 



= qnx 4 + <?22y 4 + (<733 + 2<7i2)£ V + 2q 13 x 3 y + 2q 23 xy 3 

Therefore, in order to have an identity, the following linear equalities should hold: 

q u = 2, q 2 2 = 5, q 33 + 2q 12 = -1, 2q 13 = 2, 2q 23 = 0. (7) 

A positive semidefinite Q that satisfies the linear equalities can then be found using 
semidefinite programming. A particular solution is given by: 



Q 



-3 1 
5 
5 



L T L, 



L 



1 

72 



-3 1 
1 3 



and therefore we have the sum of squares decomposition: 

F(x, y) = X -{2x 2 - 3y 2 + xy) 2 + \{y 2 + 3xy) 2 . 



Back to our concrete problem, the polynomial P in (|5|) has four variables, 123 
nonzero monomial terms, and total degree 20. Notice that a polynomial of that degree 
and number of variables generically has ( 20 ^ 4 ) = 10626 monomials, so P is quite sparse. 
In particular, it has a bipartite structure, with degree 12 in x, y, and degree 8 in z, w. 
Furthermore, P has very appealing symmetry properties: some inherited from L, and 
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some as a result of the substitution (|4|). Concretely, it is easy to see that P is invariant 
under the transformations: 

(x,y,z,w) -> (y,x,w,z) (8) 
(±x,±y,±z,±w) (9) 

The first property is a clear consequence of the symmetry of our original geometric in- 
equality with respect to interchange of the two triangles. The second one is a side effect 
of our choice for modeling the nonnegativity constraints. The transformations given 
above generate a symmetry group G with 32 elements and 14 irreducible representa- 
tions: eight one-dimensional and six two-dimensional (8 • l 2 + 6 • 2 2 = 32). As explained 
extensively by Gatermann and Parrilo in [EJ, these symmetries can be exploited very 
successfully in reducing the computational requirements. 

To do this, the approach in || relies on a crucial property of convex optimization 
problems invariant under a group action, namely the fact that the optimal solution can 
always be restricted to the fixed-point subspace. Using Schur's lemma of representation 
theory, it is shown there that by using an appropriate symmetry-adapted coordinate 
transformation, the original semidefinite program can be decomposed into a collection 
of smaller coupled problems, of cardinality equal to the number of irreducible repre- 
sentations of the group. This reduces both the size and the number of variables in the 
problem, and as a consequence notably enhances both the accuracy and conditioning 
of the solution. 

Attempting to directly establish the nonnegativity of P without taking into account 
both the sparsity and symmetries can be a difficult (or even impossible) task for current 
SDP solvers, both in terms of memory requirements and accuracy. A naive approach, 
using only degree information but no structure whatsoever, would require solving a 
semidefinite program of dimension 1001 x 1001 and 10626 constraints. By exploiting 
only the sparsity of P, but not its symmetry, the problem is reduced to dimension 137 x 
137 and 1328 constraints. Adding the simplifications resulting from the symmetries, 
the problem is further simplified to a much more manageable one with 14 coupled 
SDP (one for each irreducible representation), of dimensions ranging between 2x2 
and 11 x 11 (see Table |]). For instance, for the trivial irreducible representation (# 1 
in the table), the corresponding new variables u are invariant under the group action, 
and given by: 

y 2 z 2 + x 2 w 2 

x 2 z 2 w 2 + y 2 z 2 w 2 , y 4 z 2 + x 4 w 2 , x 2 y 2 z 2 + x 2 y 2 w 2 , x 4 y 2 + x 2 y 4 
x 2 y 2 z 2 w 2 , x 4 z 2 w 2 + y 4 z 2 w 2 , x 2 y 4 z 2 + x 4 y 2 w 2 , x 4 y 2 z 2 + x 2 y 4 w 2 . 

Notice in Table p] that the combined total, taking into account multiplicities, is equal 
to 137, the dimension of the sparse version of the problem. 

The resulting system of matrix inequalities can be solved with standard SDP solvers, 
such as SeDuMi [15]. The output provides a decomposition of P as a sum of squares 



of polynomials, with coefficients given by floating point numbers. In this particular 
case, the computed values immediately suggest the existence of a solution, presented 
below, with polynomials having integer coefficients. The solution can be verified in a 
completely independent fashion, providing a mathematically correct certificate of the 
nonnegativity of the polynomial P. 
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Irr. Rep. # 


1 


2 


3 


4 5 


6 


7 8 


9 


10 11 


12 13 


14 


Multiplicity 


1 


1 


1 


1 1 


1 


1 1 


2 


2 2 


2 2 


2 


Dim. SDP 


9 


6 


6 


4 8 


5 


3 2 


11 


7 8 


7 8 


6 



Table 1: Irreducible representations of G and the corresponding SDP dimensions. 
Proof of Lemma 3: (continued) Consider the following three polynomials: 

a/ \ 2 2 42,2 2,^22 2 n 2 2 2 24 

A(x, y,z,w) = —y z — y z + x w + 2x y w — 2x y z — x y — 

o 2 4 2 , 4 2, 4 2 , o 4 2 2 

—2xyz + x w + x y + 2x y w , 

d/ \ /i , 2 i 2\/ 22 222 222, 222, 

B(x, y, z, w) = (1 + x + y ){—x w — x z w — x y w + x y z + 

,22, 2 2 2\ 
+y Z +y Z W ) 

and 

\ / x / . x / 222,22,222,222 

G (x, y, z, = [x — y)(x + y)\—x z w + x y + x y w + x y z — 

2 2 2 2 2\ 

—z w — y z w ). 
Then we have the following identity 

P(x, y, z, w) = A(x, y, z, w) 2 (z 2 + w 2 + 2z 2 w 2 ) + B(x, y, z, w) 2 + C(x, y, z, w) 2 . 

Thus P(x, y, z, w) is a sum of five squares of real polynomials and the proof of Lemma 
3 is completed. O 

Rewriting the obtained sum of squares decomposition in terms of the original vari- 
ables, the following representation of L can be obtained: 

L = L\ + L2 + L3 

Li = + 5)(-a 2 p + aft 2 - a5 + - (3j5 + a5-y - a(3 2 -f + a 2 (35) 2 
L 2 = (i- 7 )(i-<5)(a/3-l) 2 (a<5-/?7) 2 

L 3 = (l- 7 )(l-5)(«-/3) 2 (a/3-75) 2 . 

From this, stronger conclusions on the sign of L can be immediately derived: not only 
it is nonnegative on the open unit hypercube (0, l) 4 as needed for Lemma 3, but the 
same property holds on the much larger region IR x IR x {7 + 8 > 0, (1 — 7)(1 — 5) > 0}. 

Acknowledgments: We would like to warmly acknowledge the help of Bruce Reznick, 
who made possible this collaboration by introducing the authors to each other's work. 
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